User Guide > Concepts > Results Manager |
If an application needs to do more than simply export OCR results to a searchable text or PDF document, then the Internal Structured Data generated by a call to recognize() can be accessed via a set of "Get" functions that return arrays to result elements. The Get functions and the arrays returned, as a whole, are referred to as the Results Manager. The term, result element, is used to refer to any document, page, region, text block, text line, word, or character.
In order to use the Results Manager, it is beneficial to understand the relationship between the different layers of the Internal Structured Data. (See figures 1 and 2 in OCR Xpress for Node.js Functionality)
Note that for regions and text blocks there are no hard rules for segmenting them. There are many elements of an image that can affect how the two layers get segmented. Their primary use is for generalized segmentation or grouping of the more important sub-layers; text lines, words, and characters. |
The Results Manager allows applications to access the layers of the Internal Structured Data in a coherent manner. In short, applications can interrogate every item in every sub-layer of a page, every item in every sub-layer of a text block, and so on. Every result element can be interrogated for its content, its position in the image, or even the confidence that the OCR result is correct.
A particular result (e.g., a word) can be accessed in one of the following ways: